G-Store: A Storage Manager for Graph Data
نویسندگان
چکیده
Graph data is ubiquitous: Social networks, Semantic Web, pointer analysis in software engineering, and biological and chemical networks all rely on a graph representation of data. This paper makes the case for a native storage layer for graph data, rather than relying on relational or columnar stores. We propose a lightweight storage manager for graph data called G-Store. It exploits the structure of the graph for placement of data in pages that is optimized for a wide range of access patterns found in graph queries. Our placement approach partitions the data into pages using a multilevel partitioning algorithm and arranges the pages on disk to minimize the distance on disk between adjacent vertices. Initial experiments show that G-Store can outperform existing graph database solutions by orders of magnitude. We believe that these results justify a promising avenue of research into storage-aware graph databases. We discuss some of these research directions.
منابع مشابه
StorHouse/Relational Manager (RM) - Active Storage Hierarchy Database System and Applications
This paper describes how database systems can use and exploit a cost-effective active storage hierarchy. By active storage hierarchy we mean a database system that uses all storage media (i.e. optical, tape, and disk) to store and retrieve data and not just disk. We describe and emphasize the active part, whereby all storage types are used to store raw data that is converted to strategic busine...
متن کاملA demonstration of the G∗ graph database system
The world is full of evolving networks, many of which can be represented by a series of large graphs. Neither the current graph processing systems nor database systems can efficiently store and query these graphs due to their lack of support for managing multiple graphs and lack of essential graph querying capabilities. We propose to demonstrate our system, G*, that meets the new challenges of ...
متن کاملThe TileDB Array Data Storage Manager
We present a novel storage manager for multi-dimensional arrays that arise in scientific applications, which is part of a larger scientific data management system called TileDB. In contrast to existing solutions, TileDB is optimized for both dense and sparse arrays. Its key idea is to organize array elements into ordered collections called fragments. Each fragment is dense or sparse, and groups...
متن کاملStoring RDF as a Graph
RDF is the first W3C standard for enriching information resources of the Web with detailed meta data. The semantics of RDF data is defined using a RDF schema. The most expressive language for querying RDF is RQL, which enables querying of semantics. In order to support RQL, a RDF storage system has to map the RDF graph model onto its storage structure. Several storage systems for RDF data have ...
متن کاملCLEO III Data Storage
The CLEO III experiment will collect on the order of 200 TB of data over the lifetime of the experiment. The challenges facing CLEO III are how to store such a large dataset. We will describe our experiences with Objectivity/DB on top of a Hierarchical Storage Manager.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010